
PI 1191B19 




UNITED STATES DEPARTMENT OF COMMERCE 
United States Patent and Trademark Office 



July 08, 2004 

THIS IS TO CERTIFY THAT ANNEXED HERETO IS A TRUE COPY FROM 
THE RECORDS OF THE UNITED STATES PATENT AND TRADEMARK 
OFFICE OF THOSE PAPERS OF THE BELOW IDENTIFIED PATENT 
APPLICATION THAT MET THE REQUIREMENTS TO BE GRANTED A 
FILING DATE. 

APPLICATION NUMBER: 60/485,301 
FILING DATE: July 05, 2003 

RELATED PCT APPLICATION NUMBER: PCT/US04/15587 




By Authority of the 

COMMISSIONER OF PATENTS AND TRADEMARKS 



it™ 




... 0y -< 




M. SIAS 
Certifying Officer 

PRIORITY 
DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH RULE 17.1(a) OR (b) 




PTQfSB/16 (10-01) 
Approved for use through 10/31/2002. OMB 0651-0032 
, Patent and Trademark Office: U.S. DEPARTMENT OF COMMERCE 

Under the Paperwork Reduction Act of 1995. no persons are required to respond to a collection d information unless U displays a valid OMB control number. 

PROVISIONAL APPLICATION FOR PATENT COVER SHEET 
This Is a request for filing a PROVISIONAL APPLICATION FOR PATENT under 37 CFR 1 -S3 (c). 
| Express Malt Label No. 



tNVENTOR(S) 



Given Name {first and middle [if any]) 



Family Name or Surname 



Residence 
(City and either State or Foreign Country) 



Devin 
Hai 



Dressman 
Yan 



Baltimore. MD 
Baltimore, MD 



in 



HI Additional Inventors are being named on the j separately numbered sheets attached hereto 



TITLE OF THE INVENTION (500 characters max) 



METHOD AND COMPOSITIONS FOR DETECTION AND ENUMERATION OF GENETIC VARIATIONS 



Direct alt correspondence to: 
3 Customer Number 
OR 



22907 



CORRESPONDENCE ADDRESS 
— 1 ► 



Type Customer Number here 



Place Customer Number 
Bar Code Label here 



Finn or 

Individual Name 



Address 



Address 



City 



State 



ZIP 



Country 



Telephone 



Fax 



ENCLOSED APPLICATION PARTS (check all that apply) 



Specification Number of Pages * LlL 
IS Drawing(s) Number of Sheets \6 



□ CD/s). Number 1 [ 



E><3 Other (specify) |1 page Supplementary Tab 



Q Application Data Sheet. See 37 CFR 1.76 



METHOD OF PAYMENT OF FILING FEES FOR THIS PROVISIONAL APPLICATION FOR PATENT 



(23 Applicant claims small entity status. See 37 CFR 1.27. 
O A check or money order is enclosed to cover the filing fees 

[>2 The Commissioner is hereby authorized to charge filing 

fees or credit any overpayment to Deposit Account Number: 
□ Payment by credit card. Form PTO-2038 is attached. 



FILING FEE 
AMOUNT ($) 



19-Q733 



The invention was made by an agency of the United States Government or under a contract with an agency of 
the United States Government. 

□ No. 

El Yes, the name of the U.S. Government agency and the Government contract number are: National Institutes of Health CA 
43460. CA 57345. and CA62924 . 



Respectfully submin 
SIGNATURE 



Date 107/05/03 1 



TYPED or PRINTED NAME Sarah A. Kagan 
TELEPHONE 202 508 9100 



REGISTRATION NO. 
(if appropriate) 
Docket Number: 



| 32,141 ~| 



001107.00356 



USE ONLY FOR FILING A PROVISIONAL APPLICATION FOR PATENT 




the complete provisional application to the PTO. Time win vary depending upon Ihe individual case. Any comments on the amount of time you require to complete-mis 
form and/or suggestions for reducing this burden, should be Bent to the Chief Information Officer. U.S. Patent and Trademark Office. U.S. Department of Commerce, 
Washington. D.C.. 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO. Box Provisional Application. Assistant Commissioner for 
Patents. Washington, D.C. 20231. 



PROVISIONAL APPLICATION COVER SHEET 

Additional Page 

PTO/SB/16 (02-O1) 
Approved for use through 10/31/2002. 0MB 0651-0032 
Patent and Trademark Office: U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Aci of 1995, no parsons ere required to respond to a collection of information unless U displays a valid OMB control number. 



Docket Number 



001107.00356 



INVENTGR<S)/APPUCANT(S) 



Given Name {first and middle fif any]) 



Family or Surname 



Residence 
(City and either State or Foreign Country) 



Kenneth W. 
Bert 



Kinzler 
Vogelstein 



Bel Air,MD 
Baltimore. MD 



Number 2 of 2 



WARNING: Information on this form may become public. Credit card information should not 
be included on this form. Provide credit card information and authorization on PTO-2038. 



Patent Application 



Atty. Docket No.: 001 107.00356 



METHOD AND COMPOSITIONS FOR DETECTION AND ENUMERATION OF 

GENETIC VARIATIONS 



jOl j The invention disclosed herein was mad using funds from the National Institutes of 
Health grants CA 43460, CA 57345, and CA62924. The United States government 
therefore retains certain rights in the invention. 

102| A portion of the disclosure of this patent document contains material which is subject 
to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent disclosure, as it appears 
in the Patent and Trademark Office patent file or records, but otherwise reserves all 
copyright rights whatsoever. 

FIELD OF THE INVENTION 

|03| The invention relates to the field of genetic analysis. In particular, it relates to 
methods and compositions for analyzing variations in individual genes or transcripts 
and separating variants. 

BACKGROUND OF THE INVENTION 

(041 The study of DNA sequence variation is essential for many areas of research. The 
study of germ-line variations is essential for assessing the role of inheritance in 
normal and abnormal physiologic states (I). Other variations, developed somatically, 
are responsible for neoplasia (2). The identification of such mutations in urine, 
sputum, and stool can therefore be used for the detection of presymptomatic cancers 
(3-5). Similarly, the detection of somatic mutations in lymph nodes, blood, or bone 
marrow can provide data about the stage of disease, prognosis, and appropriateness of 
various therapies (5). Somatic mutations in non-neoplastic cells also occur and 
appear to accumulate as humans age or are exposed to environmental hazards (6). 
Such mutations occur in only a small fraction of the cells in a tissue, thereby 
complicating their analysis. 
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[051 Central to the investigation of many of these issues is the detection and quantification 
of sequence variants within a population of DNA molecules. The number of 
molecules in each such collection is finite and therefore countable. Consider, for 
example, a collection of red and green balls. Counting these balls is simple in 
principle but subject to basic probability theory. If there is only one red ball for every 
500 green balls, then it is necessary to count several thousand balls to get an accurate 
estimate of the proportion of red balls. If it is difficult to count enough balls to make 
a reliable estimate, one can elute the paint off all the balls and measure the color of 
the resultant paint mix. 

1061 In analogous fashion, small numbers of DNA molecules that vary by subtle changes 
(single base pair substitutions or small deletions or insertions) can be directly counted 
by amplifying individual DNA molecules (single molecule PCR) (7-12). Such digital 
techniques have been shown to be extremely useful for measuring variation in genes 
or their transcripts. But digital technologies have so far been limited to counting tenb 
to thousands of molecules, either in the wells of microtiter plates, on microscope 
slides, or after electrophoresis of individual PCR products. Analog techniques, 
analogous to the elution of paint from the balls described above, are generally easier 
to implement and can assess millions of molecules simultaneously (13). However, 
their accuracy and sensitivity is limited by instrumental and experimental noise. 
There is a continuing need in the art for methods which are accurate and sensitive for 
measuring variation in genes or their transcripts. 

BRIEF SUMMARY OF THE INVENTION 

(071 In a first embodiment of the invention a composition is provided. The composition 
comprises a plurality of beads. Each of the plurality of beads comprises a plurality of 
bound polynucleotides. The polynucleotides in the composition are heterogeneous; 
however, on at least I % of said beads the plurality of bound polynucleotides is 
homogeneous. 
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1 08 1 In a second embodiment of the invention a liquid composition is provided. The liquid 
composition comprises a plurality of microemulsions forming aqueous compartments. 
At least a portion of said aqueous compartments comprise a bead, a polynucleotide 
template, and oligonucleotide primers for amplifying the template. At least a portion 
of the oligonucleotide primers is bound to the bead. 

1091 A third embodiment of the invention provides a method for analyzing nucleotide 
sequence variations. Microemulsions comprising one or more species of analyte 
DNA molecules are formed. The analyte DNA molecules in the microemulsions are 
amplified in the presence of reagent beads which are bound to a plurality of molecules 
of a primer for amplifying the analyte DNA molecules. Product beads are formed that 
are bound to a plurality of copies of a single species of analyte DNA molecule. The 
product beads are separated from analyte DNA molecules which are not bound to 
product beads. A sequence feature of the single species of analyte DNA molecule 
that is bound to the product beads is determined. 

J101 A fourth embodiment of the invention is a probe for use in hybridization to a 
polynucleotide that is bound to a solid support. The probe comprises an 
oligonucleotide with a stem-loop structure. At one of the 5' or 3' ends there is a 
photoluminescent dye. The oligonucleotide does not comprise a quenching agent at 
the opposite 5' or 3* end. 

|11 1 A fifth embodiment of the invention is a pair of molecular probes. The first and 
second probes each comprise an oligonucleotide with a stem-loop structure having a 
first photoluminescent dye at one of the 5' or 3' ends, and not comprising a quenching 
agent at the opposite 5' or 3' end. The first oligonucleotide hybridizes to a wild-type 
selected genetic sequence better than to a mutant selected genetic sequence and the 
second oligonucleotide hybridizes to the mutant selected genetic sequence better than 
to the wild-type selected genetic sequence. The first and the second photoluminescent 
dyes are distinct. 
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|12] In a sixth embodiment of the invention a method is provided for isolating nucleotide 
sequence variants. Microemulsions comprising one or more species ofanalyte DNA 
molecules are formed. Analyte DNA molecules in the microemulsions are amplified 
in the presence of reagent beads. The reagent beads are bound to a plurality of 
molecules of a primer for amplifying the analyte DNA molecules. Product beads are 
formed which are bound to a plurality of copies of one species of analyte DNA 
molecule. The product beads are separated from analyte DNA molecules which are 
not bound to product beads. The product beads which are bound to a plurality of 
copies of a first species of analyte DNA molecule are isolated from product beads 
which are bound to a plurality of copies of a second species of analyte DNA 
molecule. 

[13| These and other embodiments of the invention, which will be apparent from the entire 
description of the invention, provide the art with the ability to quantify genetic 
variations at a scale and ease heretofore unattainable. 

BRIEF DESCRIPTION OF THE DRAWINGS 

1 141 Fig- 1 is a schematic drawing of the BEAMing method. Step 1- Magnetic beads 
covalently coated with streptavidin are bound to biotinylated oligonucleotides 
("oligos"). Step 2 - An aqueous mix containing all the necessary components for 
PCR plus primer-bound beads and template DNA are stirred together with an 
oil/detergent mix to create microemulsions. The aqueous compartments (white circles 
in the gray oil layer) contain an average of <1 template molecule and <l bead. Red 
and green templates represent two template molecules whose sequences differ by one 
or many nucleotides. Step 3 - The microemulsions are temperature cycled as in a 
conventional PCR. If a DNA template and a bead are present together in a single 
aqueous compartment, the bead bound oligonucleotides act as primers for 
amplification. The straight red and green lines connected to the beads represent 
extension products from the two different kinds of templates. Step 4 - The emulsions 
are broken and the beads are purified with a magnet. Step 5 - After denaturation, the 
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beads are incubated with oligonucleotides that can distinguish between the sequences 
of the different kinds of templates- Fluorescently-labeled antibodies are then used to 
label the bound hybridization probes. This renders the beads containing PGR product 
as red or green upon appropriate laser excitation. Step 6 - Flow cytometry is used to 
count the red and green beads. 

|15| Fig. 2 is a photograph of a typical microemulsion. Microemulsions were made as 
described infra with the exception that the aqueous compartments contained cascade 
blue-labeled dCTP and the beads were pre-labeled with phycoerythrin (red) or Alexa 
488 (green). One ul of microemulsion was deposited in 1 ul of oil on a microscope 
slide prior to photography. Of the seven aqueous compartments visible in this picture, 
two contain beads. Note the heterogeneous size of the aqueous compartments (beads 
are 1 .05 microns in diameter). 

|16| Fig. 3 A to Fig. 3D show density plots of flow cytometric data obtained from 
BEAMing. The locus queried in this experiment was M1D42 and PCR products 
generated from genomic DNA were used as templates in the microemulsions. (Fig. 
3 A) Forward scatter (FSC) and side scatter (SSC) of all beads show that -80% of the 
total beads are singlets, with most of the remaining beads aggregated as doublets. The 
"noise" is instrumental and is observed with blank samples containing no beads. The 
instrument output was gated so that only singlets were analyzed for fluorescence 
analysis. The patterns observed from an individual homozygous for the L allele (Fig. 
3C), homozygous for the S allele (Fig. 3B), and heterozygous for L and S (Fig. 3D) 
are shown. The regions containing beads hybridizing to the L and S allele probes are 
labeled green and red, respectively. The region containing beads that did not 
hybridize to any probe is black and the region containing beads that hybridized to 
both probes is blue. The blue beads arose from aqueous compartments in which both 
types of template molecules were present. The proportion of singlet beads that 
hybridized to at least one of the probes was 2.9% 43%, and 20.3% in (Fig. 3B) to 
(Fig. 3D), respectively. The FSC and SSC plots in (Fig. 3 A) represent the same beads 
analyzed in (Fig. 3D). 
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|I7| Fig. 4A to Fig. 4D show density plots of BEAMing using genomic DNA or RT-PCR 
products as templates. The data in (Fig. 4A) and (Fig. 4B) were generated by 
including 10 and I ug of human genomic DNA, respectively, in the microemulsions, 
querying the MID42 locus. The data in (Fig. 4C) and (Fig. 4D) were generated using 
emulsions that contained ~ 50 pg of PCR products synthesized from cDNA of 
lymphoblastoid cells, querying the calpain-10 locus. The green and red regions 
correspond to the L and S alleles for MID42 and to the A and G alleles for calpain-10. 
The number of beads in the outlined regions containing red or green beads is shown in 
each case. The proportion of singlet beads that hybridized to at least one of the 
probes was 1 .2%, 0.6%, 6,8% and 4.2% in (Fig. 4A) to (Fig. 4D)> respectively. The 
outlined regions used for counting in (Fig. 4A) and (Fig. 4B) were identical, as were 
those used for (Fig. 4C) and (Fig. 4D). Beads that did not hybridize to any probe 
were gated out and therefore not evident in the graphs, while the region containing 
beads that hybridized to both probes is labeled blue. 

1181 Fig. 5A to Fig. 5C show detection and validation of variants present in a minor 
fraction of the DNA population. (Fig. 5A) Mixtures of PCR products containing 0% 
to 4% L alleles of MID42 were used for BEAMing. Flow cytometry such as that 
shown in Fig. 3 was used to determine the fraction of singlet beads that were red (y- 
axis). The proportion of singlet beads that hybridized to at least one of the probes 
varied from 3.2% to 4.3%. (Fig. 5B and Fig. 5C) Beads were sorted with the FACS 
Vantage SE instrument and individual red or green beads were used as templates for 
conventional PCR employing the forward and reverse primers listed in Supplementary 
Table I. Red beads generated only the S allele sequence (Fig. 5B) while green beads 
generated only the L allele sequence (Fig. 5C). ( 

|19| Fig. 6 A to 6B demonstrate the use of agar in the aqueous phase of the 
microemulsions. Emulsion bubbles that were formed by including 1.5% agarose in 
the aqueous compartment are shown. Fig. 6A shows the bubbles that have 
fluorescents in them. Fig. 6B shows a darkfield image of the bubbles with one of the 
bubbles containing a bead in it After breaking the emulsions, the droplets containing 
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magnetic beads can be recovered by centrifugation and size fractionated through 
filtration or flow sorting. 



DETAILED DESCRIPTION OF THE INVENTION 

120] The inventors describe herein a digital technology, called BEAMing, that has the 
power to assess millions of molecules and can be generally applied to the study of 
genetic variation. The technology involves conversion of single DNA molecules to 
single beads each containing thousands of copies of the sequence of the original DNA 
molecule. The number of variant DNA molecules in the population can then be 
assessed, for example, by staining the beads with fluorescent probes and counting 
them using flow cytometry. Beads representing specific variants can be optionally 
recovered through flow sorting and used for subsequent confirmation and 
experimentation. 

[21 1 Beads according to the present invention are also known as microspheres or 
microparticles. Particle sizes can vary between about 0.1 and 10 microns in diameter. 
Typically beads are made of a polymeric material, such as polystyrene, although 
nonpolymeric materials such as silica can also be used. Other materials which can be 
used include styrene copolymers, methyl methacrylate, functionalized polystyrene, 
glass, silicon, and carboxylate. Optionally the particles are superparamagnetic, which 
facilitates their purification after being used in reactions. 

1 22 1 Beads can be modified by covalent or non-covalent interactions with other materials, 
either to alter gross surface properties, such as hydrophobicity or hydrophilicity, or to 
attach molecules that impart binding specificity. Such molecules include without 
limitation, antibodies, Hgands, members of a specific-binding protein pair, receptors, 
nucleic acids. Specific-binding protein pairs include avidin-biotin, streptavidin- 
biotin, and Factor Vll-Tissue Factor. 
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J23| Beads, after being prepared according to the present invention as product beads, have 
more than one copy of the same nucleic acid molecule bound to them. Preferably 
each bead is bound to at least 1 0, 50, 1 00, 500, or 1 000 molecules of the same nucleic 
acid sequence. In some circumstances some of the product beads are hound to more 
than one type of nucleic acid molecule. These product beads are generally less useful 
in the analysis of ratios of genetic sequences in a population of genetic sequences. 
Such product beads can be readily discriminated and so will not distort the analysis. 

1241 A population of product beads will often comprise two or more types of nucleic acids. 
Such a population is heterogeneous with respect to the nucleic acids. Desirably, a 
substantial proportion of the product beads comprise only one type of nucleic acid per 
bead. A substantial proportion can be for example, at least I %, at least 5 %, at least 
10 %, or at least 50 %. A product bead with only one type of nucleic acid per bead is 
termed homogeneous. Homogeneous beads with only one type of nucleic acid per 
bead include those with nucleic acids containing errors due to errors in polymerase 
chain reaction. A product bead with two types of nucleic acid per bead is termed 
heterogeneous. Although not wishing to be bound by any particular theory, 
heterogeneous product beads are thought to result from aqueous compartments which 
have more than two molecules of template of non-identical sequence. A population of 
product beads can be heterogeneous as a population but contain individual product 
beads that are homogeneous 

125J Individual product beads preferably comprise more than one copy of template analyte 
molecule. Each bead may comprise at least 10, at least 50, at least 100, at least 500, 
or at least 1 000 copies of template analyte. If the bead is homogeneous, each of those 
copies will be identical. 

J261 Populations of product beads can be maintained in a liquid suspension. Alternatively 
they can be sedimented and dried or frozen. The latter alternatives may be beneficial 
for storage stability. 
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|27| Analysis of populations of product beads can be useful for distinguishing between 
many kinds of genetic variants. Polynucleotides can be distinguished which differ by 
as little as a single nucleotide polymorphism (SNP), by the presence or absence of a 
mutation, by the presence or absence of an insertion, by the presence or absence of a 
non-single nucleotide polymorphism. Thus populations of product beads may be 
heterogeneous with regard to these genetic variations. 

128] One very convenient way for distinguishing genetic variants, /.*., determining a 
sequence feature of the analyte, is by differentially labeling the variants with 
fluorescent dyes. Such labeling can be accomplished by hybridization of a 
fluorescently labeled oligonucleotide probe to one species of polynucleotide. 
Alternatively, a fluorescently labeled antibody can be used to specifically attach to 
one oligonucleotide probe that hybridizes to a particular genetic variant. Such 
antibody binding can be, for example, mediated by a protein or polypeptide which is 
attached to an oligonucleotide hybridization probe. Of course, other means of 

labeling polynucleotides as are known in the art can be used without limitation. 

Another means of labeling different polynucleotide species is by primer extension. 

Primers can be extended using labeled deoxyribonucleotides, such as fluorescently 

labeled deoxyribonucleotides. 

129J Oligonucleotide primers can be bound to beads by any means known in the art. They 
can be bound covalently or non-covalently. They can be bound via an intermediary, 
such as via a protein-protein interaction, such as an antibody-antigen interaction or a 
biotin-avidin interaction. Other specific binding pairs as are known in the art can be 
used as well. To achieve optimum amplification, primers bound to the bead may be 
longer than necessary in a homogeneous, liquid phase reaction. Oligonucleotide 
primers may be at least 12, at least 1 5, at least 1 8, at least 25, at least 35, or at least 45 
nucleotides in length. The length of the oligonucleotide primers which are bound to 
the beads need not be identical to that of the primers that are in the liquid phase. 
Primers can be used in any type of amplification reaction known in the art, including 
without limitation, polymerase chain reaction, isothermal amplification, rolling circle 
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amplification, self-sustaining sequence replication (3SR), nucleic acid sequence-based 
amplification (NASBA), transcription-mediated amplification (TMA), strand- 
displacement amplification (SDA), and ligase chain reaction (LCR). 

|30| Microemulsions are made by stirring or agitation of oil, aqueous phase, and detergent. 
The microemulsions form small aqueous compartments which have an average 
diameter of 0.5 to 50 microns. The compartments may be from 1 to 10 microns, 
inclusive, from 1 1 to 100 microns, inclusive, or about 5 microns, on average. All 
such compartments need not comprise a bead. Desirably, at least one in 10,000 of 
said aqueous compartments comprise a bead. Typically from 1/100 to 1/1 or from 
1/50 to 1/1 of said aqueous compartments comprise a bead. In order to maximize the 
proportion of beads which are homogeneous with respect to oligonucleotide, it is 
desirable that on average, each aqueous compartment contains less than I template 
molecule. Aqueous compartments will also desirably contain whatever reagents and 
enzymes are necessary to carry out amplification. For example, for polymerase chain 
reaction (PCR) the compartments will desirably contain a DNA polymerase and 
deoxyribonucleotides. For rolling circle amplification a DNA polymerase and a 
generic DNA circle may be present. 

1311 Emulsions can be "broken" or disrupted by any means known in the art. One 
particularly simple way to break the emulsions is to add more detergent. Detergents 
which can be used include, but are not limited to Triton XI 00, Laureth 4, Nonidet. 

|32| Sample DNA for amplification and analysis according to the present invention can be 
genomic DNA, cDNA, PCR products of genomic DNA, or PCR products of cDNA, 
for example. Samples can be derived from a single individual, for example, from a 
body sample such as urine, blood, sputum, stool, tissue or saliva. Samples can also be 
derived from a population of individuals. The individuals can be humans, but can be 
any organism, plant or animal, eukaryotic or prokaryotic, viral or non-viral. 

|33| Any type of probe can be used for specific hybridization to the amplified 
polynucleotides which are bound to the beads. Fluorescently labeled probes are 
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useful because their analysis can be automated and can achieve high throughput. 
Fluorescence activated cell sorting (FACS) permits both the analysis and the isolation 
of different populations of beads. One type of fiuorescently labeled probe that can be 
used is a modified molecular beacon probe. These probes have stem-loop structures 
and an attached fluorescent moiety on the probe, typically on one end of the probe, 
sometimes attached through a linker. Unlike standard molecular beacon probes, 
modified molecular beacon probes do not have a quenching moiety. The modified 
molecular beacon probe can have the fluorescent moiety attached on either end of the 
probe, 5' or 3'. One such probe will hybridize better to a wild-type sequence than to a 
mutant. Another such probe will hybridize better to a mutant sequence than to the 
wild type. Still other probes will preferably hybridize to one polymorphic variant 
over another. 

[34] The method of the present invention provides a reliable and sensitive assay for 
measuring variations in genes and transcripts. It requires no instrumentation other 
than machines that are widely available. There are several other advantages of this 
approach. First, the sensitivity can be increased to meet the particular specifications 
of an assay simply by analyzing more beads. Such sensitivity is limited only by the 
error rate of the polymerases used for amplification. Second, the data obtained can be 
used not only to demonstrate that a variant is present in a particular population of 
DNA molecules, but also quantifies the fraction of variant DNA molecules in that 
population (Fig. 5A). Such quantification is not possible with techniques that destroy 
or ignore the wild type molecules as part of the assay, such as those that use allele 
specific priming or endonuclease digestion during PCR. Third, the beads containing 
variant alleles can easily be purified through flow sorting. Such recovery is difficult 
with digital techniques that count molecules deposited on microscope slides. And 
finally, the method is automatable. 

|35| Several modifications of the basic principles described here can be envisioned that 
will further simplify the technology or widen its applications. For example, 
microemulsions were made by stirring water/oil/detergent mixes. The sizes of the 
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resultant aqueous compartments were somewhat heterogeneous, as illustrated in Fig. 
2. A relatively large number of beads containing PCR products of both alleles are 
obtained from large compartments because they are more likely to contain >l 
template molecule than smaller compartments. Though this is not a problem for the 
analysis of uncommon variants, it does pose a problem when the variant to be 
analyzed is present in a substantial fraction of the DNA molecules. For example, it is 
easy to distinguish a population containing 2% of allele A and 98% of allele B from 
one that contains 0% of allele A (Fig. 5A). But it is more difficult to distinguish a 
population that contains 48% of allele A and 52% of allele B from a population that 
contains 50% of allele A; the large number of heterozygote beads formed in the latter 
analysis diffuse the boundaries of the pure red and green channels. This limit to 
accuracy can be overcome through the preparation of more uniformly sized aqueous 
compartments. Sonication or pressure-driven emulsifiers can make more uniform 
compartments. 

[361 Though flow cytometry requires only seconds to minutes per sample, multiple parallel 
analyses could facilitate throughput. Novel particle counting designs may prove 
useful for this purpose. Another way to increase throughput would be to physically 
separate the beads that contained PCR products prior to flow cytometry. This could 
be accomplished with proteins such as antibodies or streptavidin that bind to modified 
nucleotides incorporated into the PCR product during amplification. 

137J The methods of the invention can be applied to genes or transcripts of any organism 
or population of organisms. These include without limitation: humans, rodents, 
ungulates, mammals, primates, cows, goats, pigs, rats, mice, yeast, poultry, fish, 
shellfish, digs, cats, zebrafish, worms, algae. It can also be used to quantify 
epigenetic alterations, such as methylation, if DNA is first treated with bisulfite to 
convert methylated cytosine residues to thymidine. Beads generated from random 
fragments of whole genomes (24), rather than from individual genes as described 
above, could be used to identify gene segments that bind to specific DNA-binding 
proteins (25). And if the product beads are used in compartmentalized in vitro 
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transcription-translation reactions, variant proteins can be bound to beads containing 
the corresponding variant DNA sequences (23). This could allow facile flow 
cytometric evaluation of rare mutations using antibodies that distinguished between 
wild type and mutant gene products (26). 

|381 While the invention has been described with respect to specific examples including 
presently preferred modes of carrying out the invention, those skilled in the art will 
appreciate that there are numerous variations and permutations of the above described 
systems and techniques that fall within the spirit and scope of the invention as set 
forth in the appended claims. 

EXAMPLES 

Example I— Materials and Methods 

1391 Step I - Coupling oligonucleotides to beads. Superparamagnetic beads of 1 .05 +/- 0. 1 
urn in diameter, covalently bound to streptavidin, were purchased from Dynal 
Biotech, Inc. (650.01, Lake Success, NY). Beads were washed once with Ix PCR 
buffer (53286, Invitrogen, Carlsbad, CA) then suspended in Bind and Wash Buffer 
(BWB) (5 mM Tris-HCl, 0.5 mM EDTA, 1.0 M NaCI, pH 7.5). Beads were 
incubated in BWB for 30 min at room temperature in the presence of 10 uM 
oligonucleotides (Supplementary Table 1). These oligonucleotides were modified 
with a dual biotin group at the 5' end with the biotin groups separated by a six-carbon 
linker (IDT, Coralville, I A). After binding, the beads were washed 3 times with Ix 
PCR buffer to thoroughly remove unbound oligonucleotides. 

|40l Step 2 - Preparing microemulsions. Microemulsions for PCR were prepared by slight 
modifications of previously described methods (14) (15). The oil phase was 
composed of 4.5% Span 80 (S6760, Sigma, St. Louis, MO), 0.40 % Tween 80 (Sigma 
S-8074), and 0.05% Triton X-100 (Sigma T-9284) in mineral oil (Sigma M-3516). 
The oil phase was freshly prepared each day. The aqueous phase consisted of 67 mM 
Tris-HCl (pH 8.8), 16.6 mM NH4S04, 6.7 mM MgCI2, 10 mM P-mercaptoethanol, I 
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mM dATP, I mM dCTP, I mM dGTP, I mM dTTP, 0.05 uM forward primer, 25 uM 
reverse primer, 45 units Platinum Taq (Invitrogen 10966-034), various amounts of 
template DNA (see results), and -108 oligonucleotide-coupled beads in a total 
volume of 300 ul. The forward primer was an oligonucleotide whose sequence was 
identical to the 3' 20 - 22 nt of that described in step I and was not modified with 
biotin. 

|41| Water-in-oil microemulsions were prepared by drop wise addition of 200 uL of the 
aqueous phase to 400 ul of the oil phase previously placed in a 2 ml round bottom 
cryogenic vial (430661, Corning, Corning, NY). The drop wise addition was 
performed over ~ one minute while the mixture was being stirred at 1400 RPM with a 
magnetic microstir bar (58948-353, VWR, Plainfield, NJ) on a VWR model 565 
Magnetic Stirrer. After the addition of the aqueous phase, the mixture continued to be 
stirred for a total time of 30 minutes. Two emulsions were made at once by placing 
two tubes in a rack placed at the center of the magnetic stirrer. 

[421 Ste P 3 - PCR cycl ing. The emulsions were aliquotted into five wells of a 96 well PCR 
plate, each containing 100 ul. PCR was carried out under the following cycling 
conditions: 94°C for 2 minutes; 40 cycles of 94°C for 15 seconds, 57°C for 30 
seconds, 70°C for 30 seconds. The PCR products analyzed in this study ranged from 
189 to 239 bp. 

[43] Step 4 - Magnetic capture of beads. After PCR cycling, the microemulsion from five 
wells of a PCR plate were pooled and broken by the addition 80p ul of NX buffer 
(100 mM NaCt containing 1% Triton X-I00, 10 mM Tris-HCl, pH 7.5, I mM EDTA) 
in a 1 .5 ml tube (Corning 430909). After vortexing for -20 sec. the beads were 
pelleted by centrifugation in a microcentrifuge at 8000 rpm (5000 g) for 90 seconds. 
The top oil phase and all but -300 ul of the aqueous phase was removed from the tube 
and 600 ul of NX buffer was added. After vortexing for 20 sec. and centrifugation for 
90 sec, the top oil phase and all but -300 ul of the aqueous phase was removed. The 
addition of 600 ul NX buffer, vortexing, and centrifugation was repeated once more 
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and the top oil portion and all but -300 ul of the aqueous phase was removed. The 
tube was then placed on a magnet (Dynal MPC-S) and the rest of the supernatant was 
« carefully pipetted off. The beads were washed an additional 3 times with Ix PCR 
bufFer using magnetic separation rather than ccntrifugation and finally resuspended in 
100 ul of I x PCR buffer. 

|44| Step 5 - Sequence differentiation. Two oligonucleotide probes were used for each 
reaction. One was 5Mabeled with 6-carboxyfluorescein (6-FAM) and was specific 
for one allele while the second was S'-labeled with biotin and was specific tor the 
other allele. Probes were synthesized by IDT. The 30 ul hybridization reactions 
contained 10 uM of each probe and 5 - 25 million beads inl x PCR buffer. Reactions 
were performed in PCR plates on a thermal cycler by heating to 94°C for 30 seconds 
then cooling to 75°C at a rate of 0.5°C per second, cooling to 45°C at 0.2°C per 
second, and finally cooled to 30°C at 1°C per second. AH subsequent steps were 
performed at room temperature. The reactions were transferred to a 96 well Costar 
plate (Corning 3797) and placed on a 96 well magnet. Beads were collected 
magnetically by exposing them to the magnet for 2 minutes. The supernatant was 
removed and the beads washed 3 times with Ix PCR buffer by pipetting them and 
collecting for two minutes. They were finally resuspended in 100 ul B-PCR buffer 
(Img/mL BSA in Ix PCR buffer). The beads were then incubated for 10 minutes in a 
total volume of 100 ul B-PCR buffer containing 3 ug of Alexa-488 rabbit anti- 
fluorescein antibody (Molecular Probes A-11090, Eugene, OR) and 3 ug of 
Nutravidin labeled with phycoerythrin (Molecular Probes A-2660) in B-PCR buffer. 
The beads were washed three times and resuspended in B-PCR buffer as described 
above. They were then incubated for ten minutes in a total volume of 100 ul B-PCR 
buffer containing 6 ug of Alexa 488- conjugated chicken anti-rabbit antibody 
(Molecular Probes A-2l44i)and 3 ug of biotinylated goat anti-avidin antibody (BA- 
0300, Vector Laboratories, Burlingame, CA). The beads were washed three times and 
resuspended in B-PCR buffer as described above. They were then incubated for ten 
minutes in a total volume of 100 ul B-PCR buffer containing 3 ug of an Alexa 488- 
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conjugated goat anti-chicken antibody (Molecular Probes A- 11039) and 3 ug of 
phycoerythrin-labeled strcptavidin (Molecular Probes S-866). This solution was then 
washed an additional 3 times with Ix PCR buffer and resuspended in 20 ul of I X 
PCR buffer. 

|4SJ Step 6 - Flow Cytometry. The bead suspension was diluted to a concentration of ~ 
106 -107 beads per ml in 10 mM Tris-HCI, I mM EDTA (351-010-131, Quality 
Biological, Inc., Gaithersburg, MD) and analyzed using a LSR instrument (BD 
Biosciences, Franklin Lakes, NJ). The instrument was set up for standard two-color 
analysis using an argon laser and optical filters that distinguished between the two 
fluorescent dyes. No spectral deconvolution was required as the major bead 
populations were well-separated. In some cases, scanning was performed with 
FACScan or FACSCalibur instruments (BD Biosciences), yielding equivalent results. 
Sorting was carried out with a FACS Vantage SE instrument (BD Biosciences). 

1461 Template preparation and sequence analyses. Human genomic DNA was purified 
with DNeasy (69504, Qiagen, Valencia, CA). RNA was purified with Quickprep (27- 
9255-01, Amersham Biosciences, Piscataway, NJ). Reverse transcription of RNA 
was performed using Superscript II reverse transcriptase (Invitrogen 18064014) 
according to the manufacturer's instructions. PCR using genomic DNA or reverse 
transcripts as templates was performed as described (7). PCR products to be used as 
templates for BEAMing or for sequencing were purified with QIAquick (Qiagen 
28104). Sequencing reactions were performed using Big Dye v3.0 reagents (Applied 
Biosystems, Foster City, CA) and analyzed by capillary electrophoresis 
(Spectrumedix 9600, State College, PA). 

Example 2-Results 

(47J Step I - Coupling oligonucleotides to beads. We used streptavidin-beads because of 
the simplicity of coupling biotinylated oligonucleotides to them. Oligonucleotides 
with just a single 5' biotin group were found to dissociate from the beads during 
temperature cycling, while oligonucleotides labeled with dual biotin groups at their 5' 



- 16- 



Patent Application Atty. Docket No.: 001 107.00356 

end (separated by a six-carbon linker) were stable to cycling. As determined by 
fluoroscopic measurements of oligonucleotides doubly labeled with 6-FAM and 
biotin, —105 oligonucleotide molecules were bound to each bead. We found that short 
oligonucleotides (20 bases) did not work as well Tor priming as longer ones (41 bp), 
perhaps because of steric hindrance at the bead surface. It is likely that amino-, 
sulfhydryl-, or carboxyl-modified oligonucleotides covalently coupled to beads 
modified with corresponding reactive groups could also function as bead-bound 
primers for BEAMing. 

|48| Step 2 - Preparing microemulsions. The size of the individual aqueous compartments 
ranged from less than I micron to>10 microns in diameter (Fig. 2). We estimated 
that an emulsion comprising 200 ul of aqueous solution and 400 ul of oil would 
contain ~3 x 109 compartments with an average diameter of 5 microns. 
Approximately 108 beads were included in each emulsion, so that only one in -30 
compartments contained a bead. The optimal amount of template was experimentally 
determined to be ~ 5 x 108 molecules, so that one in -six compartments contained a 
template molecule. 

|49J Step 3 - PCR cycling. PCR priming by oligonucleotides coupled to beads was found 
to be very inefficient compared to the priming by the same oligonucleotides when free 
in solution. For this reason, a small amount of non-biotinylated forward primer 
identical in sequence to the biotinylated oligonucleotide coupled to the beads was 
included in the reactions. This facilitated the first few rounds of amplification of the 
single template within each aqueous compartment. In the absence of additional 
primer, no detectable amplification on the beads was generated. Conversely, if too 
much additional primer was included, no amplification on the beads occurred because 
of competition with the primers in solution. An excess of the reverse primer was 
included in the aqueous compartment to maximize the probability that bead-bound 
oligonucleotides extended by polymerase would serve as templates for further 
amplification cycles. 
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|S0| Step 4 - Magnetic capture of beads. There are several ways to break water-in-oil 
emulsions, including extraction with organics (14). We found that simply adding 
non-ionic detergents produced phase separations without any detectable modification 
of the beads or DN A molecules bound to them. By measuring the amount of DMA 
that could be released from the beads following restriction endonuclease digestion, we 
estimate that > 1 0,000 extended PCR products were present, on average, per bead. 

(51 1 Step 5 - Sequence differentiation. Most fluorescence-based methods for 
distinguishing alleles in homogeneous or two-phase assays can be used to assess 
allelic variation captured on beads. These methods include single nucleotide 
extension, allele specific priming, or hybridization. We generally employed 
hybridization of fluorescein-conjugated or biotin-conjugated oligonucleotides for 
discrimination. As shown in Fig. 1 and Supplementary Table 1, these 
oligonucleotides had a stem-loop structure, with the middle of the loop containing the 
variant nucleotide(s). This design was based on studies of Molecular Beacons 
wherein a stem-loop structure was shown to markedly improve allelic discrimination 
(16). The oligonucleotides we used differed from Molecular Beacons in that there 
was no need for a quenching group. Such quenching is required for homogeneous 
assays when unhybridized oligonucleotides cannot be removed from the reactions 
prior to assay but is not necessary for solid phase assays such as those employed with 
beads. 

|52| Step 6 - Flow Cytometry. Optimum results in flow cytometry depend on high 
fluorescent signals on the beads. We generally enhanced the fluorescence emanating 
from the hybridization probes with secondary reagents. For example, Alexa 488 — 
labeled antibodies were used to enhance the signals emanating from fluorescein- 
coupled oligonucleotide probes. Similarly, phycoerythrin-labeled streptavidin was 
used to generate a signal from biotin-labeled oligonucleotide probes. Flow 
cytometers equipped with two or three lasers and appropriate filters have the capacity 
to distinguish multi-allelic loci and to perform multiplex analysis of several genes 
simultaneously. The newest generation of flow cytometers can also analyze >70,000 



-18- 



Patent Application Atty. Docket No.: 001 107.00356 

events per second. In addition to the analytical power of flow cytometry, FACS 
instruments can separate specific populations of beads for further analysis. 



Example 3- Characteristics of microemulsions. 

|53] Pilot experiments demonstrated that simply stirring the water-oil mixtures described 
in Materials and Methods produced very stable microemulsions of a size compatible 
with that of the beads. In the experiment shown in Fig. 2, the aqueous compartment 
contained a blue dye and 1 micron magnetic beads that were labeled by binding to an 
oligonucleotide that was biotinylated at its 5' end and labeled with fluorescein at its' 
3* end. The appearance of emulsions immediately after their formation is shown in 
Fig. 2. As expected, this appearance was unchanged after temperature cycling during 
PCR (15). Most aqueous compartments contained no beads, as expected from the 
figures provided in the previous section. Those compartments that did contain beads 
generally contained only one, though a fraction contained more, as expected from a 
Poisson distribution and non-uniform aqueous compartment sizes. "Heterozygous" 
beads containing PCR products representing both alleles are produced when two or 
more DNA template molecules are contained within a single aqueous compartment. 
Such heterozygotes can compromise the accuracy of the analyses under some 
circumstances (see Discussion). 

Example 4~ Detection of homozygotes apd heterozygotes. 

f54| Fig. 3 shows typical results obtained with human DNA samples. The MID42 marker 
used in this experiment was chosen from a collection of diallelic short 
insertion/deletion polymorphisms assembled by Weber and colleagues (17). These 
alleles are particularly simple to distinguish with hybridization probes because the 
two alleles at each locus differ by - 4 bases. The probe for the longer (L) allele was 
labeled with fluorescein (green) and the probe for the shorter (S) allele labeled with 
phycoerythrin (red). 
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|55| Fig. 3 A shows a plot of the side scatter vs. forward scatter of beads 'following 
BEAMing. In general, >75% of beads were dispersed as single particles, with the 
remainder aggregated in groups of two or more. Subsequent flow cytometric analysis 
was confined to the singlet beads, gated as outlined in Fig. 3A. 

[561 Figs. 3B - D show density plots of gated beads generated with various templates. In 
Fig. 3B, a template from an individual homozygous for the L allele was included in 
the emulsion. Two populations of beads were apparent. 98 % of the beads contained 
no PCR product (black) and the remaining 2% fluoresced in the FLI channel (colored 
green in Fig. 3). Fig. 3C represents the analysis of an individual homozygous for the 
S allele. Two populations of beads were again apparent, but this time the labeled 
population fluoresced in the FL2 channel (colored red in Fig. 3). Fig. 3D presents 
density plots from the analysis of an individual heterozygous at the M1D42 locus. 
Four populations of beads are evident: the black region represents beads without any 
PCR product, the red region represents beads containing PCR products from the L 
allele, the green region represents beads containing PCR products from the S allele, 
and the blue region represents beads containing PCR products from both alleles. 
Beads containing PCR products from both alleles were derived from aqueous 
compartments which contained more than one template molecule. The number of 
blue beads increased in a non-linear fashion as more template molecules were added. 
At the extreme, when all aqueous compartments are saturated, virtually all beads will 
register as blue. Operationally, we found that the bead populations were most distinct 
when the number of beads containing any PCR product was < 10% of the total beads 
analyzed. 

Example 5- PCR products, genomic DNA or cDNA as templates. 

|571 The results shown in Fig. 3 were generated using PCR products made from human 
genomic DNA samples. As the ratio of the beads representing L alleles to those 
representing S alleles was 1.0 in this experiment, it was clear that the initial PCR did 
not preferentially amplify either allele. The use of PCR products rather than genomic 
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DNA permitted large numbers of alleles to be amplified from even small quantities of 
starting DNA. In general, 10 to 100 pg of PCR products of size 200 bp were found to 
be optimal for BEAMing, producing PCR-mediated extension of primers on -1 to 
10% labeled beads. 

|58| In some situations it might be useful to use genomic DNA rather than PCR products 
as templates for BEAMing. The data in Fig. 4A and B show flow cytometric data 
from an experiment wherein 10 ug or 1 ug of human genomic DNA was used as 
template for BEAMing at the M1D42 locus. Patterns very similar to those shown in 
Fig. 3 were observed, though fewer beads were labeled than when PCR products were 
used as templates. 

|59| BEAMing could also be used to analyze variations in expression from the two alleles 
of a heterozygous individual. Heritable variations in the expression from individual 
alleles of the same gene have been shown to occur often in humans (18) and mice (19) 
and can have significant phenotypic effects (20). The results shown in Fig. 4C and D 
show that PCR products made from reverse-transcribed mRNA can be used for 
BEAMing. In this case, calpain-10 transcripts differing by a single nucleotide 
polymorphism (SNP) were analyzed. For SNPs like these, probes that incorporated 
an extra mismatched nucleotide adjacent to the polymorphic nucleotide (see 
Supplementary Table I) can enhance the distinction between alleles (21) (22). The 
results from two independent emulsions made with aliquots of the same RT-PCR 
product are shown to illustrate reproducibility. Though the number of beads that 
functioned as templates in BEAMing varied up to 3-fold among experiments with 
identical templates, the proportion of beads representing the two alleles was 
reproducible (775 A allele beads to 690 G allele beads in Fig. 4C and 1380 A allele 
beads to 1227 G allele beads in Fig. 4D) respectively). 

Example 6~ Analysis of minor variants in a DNA population. 

J601 The analysis of uncommon variations is ideally suited for analysis via BEAMing 
because of the large number of molecules that can be independently analyzed while 
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retaining a high signal-to-noise ratio. Fig. 5A shows representative data from 
templates representing 1%, 2%, 3%, and 4% of the L allele of MID42. The linearity 
of these measurements, with a correlation coefficient of 0.99, demonstrates the utility 
of this approach for such applications. We also applied this analysis to the detection 
of KRAS and could easily observe 0.1% mutants when spiked into a population of wt 
molecules (data not shown). 

[611 The rare beads representing the mutant alleles could not only be quantified but could 
also be purified for subsequent analysis. As a demonstration, samples of the beads 
enumerated in Fig. 5A were additionally assessed using a flow cytometer equipped 
with sorting capabilities. Beads were sorted and individual beads used as templates 
for conventional PCR using the same primers employed for BEAMing. As each bead 
contains thousands of bound template molecules, single beads were expected to 
generate robust PCR products (23) and this was experimentally confirmed. These 
PCR products were then subjected to sequencing. As shown in Pig. 5B and C, green 
and red beads generated PCR products exclusively of the L and S types, respectively. 
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1 . A composition comprising a plurality of beads, wherein each of said plurality of beads 
comprises a plurality of bound polynucleotides, wherein the polynucleotides in the 
composition are heterogeneous, and wherein on at least 1 % of said beads the plurality 
of bound polynucleotides is homogeneous. 

2. The composition of claim 1 wherein on at least 5 % of said beads the plurality of 
bound polynucleotides is homogeneous. 

3. The composition of claim 1 wherein on at least 1 0 % of said beads the plurality of 
bound polynucleotides is homogeneous. 

4. The composition of claim 1 wherein on at least 50 % of said beads the plurality of 
bound polynucleotides is homogeneous. 

5. The composition of claim 1 wherein the plurality of bound polynucleotides is greater 
than 100. 

6. The composition of claim I which is a liquid. 

7. The composition of claim 6 which comprises agarose. 

8. The composition of claim 1 wherein the polynucleotides in the composition differ by 
a single nucleotide polymorphism (SMP). 

9. The composition of claim 1 wherein the polynucleotides in the composition differ in 
the presence or absence of a mutation. 

10. The composition of claim 1 wherein the polynucleotides in the composition differ in 
the presence or absence of an insertion. 

1 1. The composition of claim 1 wherein the polynucleotides in the composition differ by 
the presence or absence of a polymorphism. 

12. The composition of claim I wherein the beads are magnetic. 

13. The composition of claim 1 wherein at least one species of polynucleotide is labeled 
with a fluorescent dye. 

14. The composition of claim 13 wherein the labeling is via a labeled oligonucleotide. 
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15. The composition of claim 13 wherein the labeling is via one or more labeled 
antibodies. 

16. The composition of claim I wherein the bound polynucleotides were made by 
amplification of a template in a test sample, wherein the beads on which the plurality 
of bound polynucleotides is homogeneous comprise at least a first and a second 
species of polynucleotide, wherein the beads comprising the first species of 
polynucleotide and the beads comprising the second species of polynucleotide are, 
present in the composition in the same ratio as the first and second species of 
polynucleotide were present in the test sample. 

17. A liquid composition comprising a plurality of microemulsions forming aqueous 
compartments wherein at least a portion of said aqueous compartments comprise: 

a bead; 

a polynucleotide template; and 
oligonucleotide primers for amplifying said template; 
wherein at least a portion of the oligonucleotide primers is bound to the bead. 

18. The liquid composition of claim 17 which comprises forward and reverse 
oligonucleotide primers. 

19. The liquid composition of claim 17 wherein said aqueous compartments have an 
average diameter of 0.5 to 50 microns. 

20. The liquid composition of claim 17 wherein at least one in 10,000 of said aqueous 
compartments comprise a bead. 

21. The liquid composition of claim 17 wherein from 1/100 to 1 of said aqueous 
compartments comprise a bead. 

22. The liquid composition of claim 17 wherein from 1/50 to 1 of said aqueous 
compartments comprise a polynucleotide template molecule. 

23. The liquid composition of claim 17 wherein the bead is magnetic. 

24. The liquid composition of claim 17 wherein the average number of template 
molecules per aqueous compartment is less than 1. 

25. The liquid composition of claim 17 further comprising a DNA polymerase and 
deoxy ribon ucleotides . 
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26. The liquid composition of claim 17 wherein the average diameter of said aqueous 
compartments is from 1 to 10 microns, inclusive. 

27. The liquid composition of claim 1 7 wherein the average diameter of said aqueous 
compartments is from 1 1 to 100 microns, inclusive. 

28. The liquid composition of claim 1 7 wherein the average diameter of said aqueous 
compartments is about 5 microns. 

29. The liquid composition of claim 17 wherein each oligonucleotide primer is at least 12 
nucleotides in length. 

30. The liquid composition of claim 17 wherein each oligonucleotide primer is from 25 to 
55 nucleotides. 

3 1 . The liquid composition of claim 17 wherein binding of said oligonucleotide primers 
to said bead is covalent. 

32. The liquid composition of claim 17 wherein binding of said oligonucleotide primers 
to said bead is.via a biotin-streptavidin binding pair. 

33. The liquid composition of claim 32 wherein said forward or reverse oligonucleotide 
primers that are bound to said bead comprise at least two biotin moieties. 

34. The liquid composition of claim 17 wherein the aqueous compartments comprise 
agarose. 

35. A method for analyzing nucleotide sequence variations, comprising: 

forming microemulsions comprising one or more species of analyte DNA 
molecules; 

amplifying analyte DNA molecules in the microemulsions in the presence of 
reagent beads, wherein the reagent beads are bound to a plurality of molecules of a 
primer for amplifying the analyte DIM A molecules, whereby product beads are 
formed which are bound to a plurality of copies of one species of analyte DNA 
molecule; 

separating thevproduct beads from analyte DNA molecules which are not 
bound to product beads; 

determining a sequence feature of the one species of analyte DNA molecule 
which is bound to the product beads. 
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36. The method of claim 35 further comprising the step of isolating product beads which 
are bound to a plurality of copies of a first species of analyte DNA molecule from 
product beads which are bound to a plurality of copies of a second species of analyte 
DNA molecule. 

37. The method of claim 36 wherein the step of isolating is performed using fluorescence 
activated cell sorting. 

38. The method of claim 36 further comprising the step of recovering the first species of 
analyte DNA molecule from the product beads. 

39. The method ofclaim 36 further comprising the step of amplifying the first species of 
analyte DNA molecule from the isolated product beads. 

40. The method ofclaim 38 further comprising the step of determining the sequence of 
the first species of analyte DNA molecule. 

41 . The method ofclaim 35 wherein the step of amplifying converts less than 10 % of the 
reagent beads present in the microemulsions into product beads. 

42. The method ofclaim 35 wherein prior to the step of separating, the microemulsions 
are broken by addition of one or more detergents. 

43. The method ofclaim 35 wherein the step of determining is performed by 
hybridization to oligonucleotide probes which are differentially labeled. 

44. The method ofclaim 35 wherein the relative or absolute amounts of product beads 
comprising one or more sequence features is determined. 

45. The method ofclaim 44 wherein the relative or absolute amounts are determined 
using flow cytometery. 

46. The method ofclaim 35 wherein the step of amplifying employs additional copies of 
the primer which are not bound to the reagent bead. 

47. The method ofclaim 35 wherein the analyte DNA molecules are genomic DNA. 

48. The method ofclaim 35 wherein the analyte DNA molecules are cDNA. 

49. The method ofclaim 35 wherein the analyte DNA molecules are PCR products made 
from genomic DNA. 

50. The method ofclaim 35 wherein the analyte DNA molecules are PCR products made 
from cDNA, 
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51. The method of claim 35 wherein the analyte DNA molecules are derived from a 
single individual. 

52. The method of claim 35 wherein the analyte DNA molecules are derived from a 
population of individuals. 

53. The method of claim 35 wherein the reagent beads are magnetic. 

54. The method of claim 35 wherein the step of determining a sequence feature is 
performed by extension of a primer with one or more labeled deoxyribonucleotides. 

55. A probe for use in hybridization to a polynucleotide that is bound to a solid support, 
comprising: 

an oligonucleotide with a stem-loop structure having a photoluminescent dye 
at one of the 5* or 3' ends, wherein said oligonucleotide does not comprise a quenching agent 
at the opposite 5' or 3* end. 

56. The probe of claim 55 which hybridizes to a wild-type selected genetic sequence 
better than to a mutant selected genetic sequence. 

57. The probe of claim 55 which hybridizes to a mutant genetic sequence better than to a 
wild-type genetic sequence, 

58. A pair of molecular probes comprising: 

a first oligonucleotide with a stem-loop structure having a first 
photoluminescent dye at one of the 5' or 3' ends, wherein said first oligonucleotide 
does not comprise a quenching agent at the opposite 5 ? or 3* end, wherein said first 
oligonucleotide hybridizes to a wild-type selected genetic sequence better than to a 
mutant selected genetic sequence; and 

a second oligonucleotide with a stem-loop structure having a second 
photoluminescent dye at one of the 5' or 3' ends, wherein said second oligonucleotide 
does not comprise a quenching agent at the opposite 5' or 3' end, wherein said second 
oligonucleotide hybridizes to the mutant selected genetic sequence better than to the 
wild-type selected genetic sequence; 
wherein the first and the second photoluminescent dyes are distinct 

59. A method for isolating nucleotide sequence variants, comprising: 
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forming microemulsions comprising one or more species of analyte DNA 
• molecules; 

amplifying analyte DNA molecules in the microemulsions in the presence of 
reagent beads, wherein the reagent beads are bound to a plurality of molecules of a 
primer for amplifying the analyte DNA molecules, whereby product beads are 
formed which are bound to a plurality of copies of one species of analyte DNA 
molecule; 

separating the product beads from analyte DNA molecules which are not 
bound to product beads; 

isolating product beads which are bound to a plurality of copies of a first 
species of analyte DNA molecule from product beads which are bound to a plurality 
of copies of a second species of analyte DNA molecule. 

60. The method of claim 59 wherein the step of isolating is performed using fluorescence 
activated cell-sorting. 

61 . The method of claim 59 further comprising the step of recovering the first species of 
analyte DNA molecule from the product beads. 

62. The method of claim 59 further comprising the step of amplifying the first species of 
analyte DNA molecule from the isolated product beads. 

63. The method of claim 59 farther comprising the step of determining the sequence of 
the first species of analyte DNA molecule. 
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METHOD AND COMPOSITIONS FOR DETECTION AND ENUMERATION OF 

GENETIC VARIATIONS 

Abstract of the Invention 

Many areas of biomedical research depend on the analysis of uncommon variations in 
individual genes or transcripts. Here we describe a method that can quantify such variation at 
a scale and ease heretofore unattainable. Each DNA molecule in a collection of such 
molecules is converted into a single particle to which thousands of copies of DNA identical 
in sequence to the original are bound. This population of beads then corresponds to a one-to- 
one representation of the starting DNA molecules. Variation within the original population of 
DNA molecules can then be simply assessed by counting fluorescently-labeled particles via 
flow cytometry. Millions of individual DNA molecules can be assessed in this fashion with 
standard laboratory equipment. Moreover, specific variants can be isolated by flow sorting 
and employed for further experimentation. This approach can be used for the identification 
and quantification of rare mutations as well as to study variations in gene sequences or 
transcripts in specific populations or tissues. 
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Fig. 1 Schematic of BEAMing. Step 1- Magnetic beads covalently coated with streptavidin 
are bound to biotinylated oligonucleotides ("oligos"). Step 2 - An aqueous mix containing all 
the necessary components for PCR plus primer-bound beads and template DNA are stirred 
together with an oil/detergent mix to create microemulsions. The aqueous compartments 
(white circles in the gray oil layer) contain an average of <1 template molecule and <1 bead. 
Red and green templates represent two template molecules whose sequences differ by one 
or many nucleotides. Step 3 - The microemulsions are temperature cycled as in a 
conventional PCR. If a DNA template and a bead are present together in a single aqueous 
compartment, the bead bound oligonucleotides act as primers for amplification. The straight 
red and green lines connected to the beads represent extension products from the two 
different kinds of templates. Step 4 - The emulsions are broken and the beads are purified 
with a magnet. Step 5 - After denaturation, the beads are incubated with oligonucleotides 
that can distinguish between the sequences of the different kinds of templates. 
Fluorescently-labeled antibodies are then used to label the bound hybridization probes. This 
renders the beads containing PCR product as red or green upon appropriate laser excitation. 
Step 6 - Flow cytometry is used to count the red and green beads. 
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Fig 2 Photograph of a typical microemulsion. Microemulsions were made as described in 
Materials and Methods with the exception that the aqueous compartments contained cascade 
blue-labeled dCTP and the beads were pre-labeled with phycoerythrin (red) or Alexa 488 
(green) One ul of microemulsion was deposited in 1 ul of oil on a microscope slide prior to 
photography. Of the seven aqueous compartments visible in this picture, two contain beads. 
Note the heterogeneous size of the aqueous compartments (beads are 1 .05 microns in 
diameter). 
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Fig 3 Density plots of flow cytometric data obtained from BEAMing. The locus queried in 
this experiment was MID42 and PCR products generated from genomic DNA were used as 
templates in the microemulsions. (A) Forward scatter (FSC) and side scatter (SSC) of all 
beads show that -80% of the total beads are singlets, with most of the remaining beads 
aggregated as doublets. The "noise" is instrumental and is observed with blank samp es 
containing no beads. The instrument output was gated so that only singlets were analyzed 
for fluorescence analysis. The patterns observed from an individual hom ^9°"\ f ^ e ,^ 
allele (A), homozygous for the S allele (B), and heterozygous for L and S (D are shown in 
(B) (C). and (D), respectively. The regions containing beads hybridizing to meL and S allele 
probes are labeled green and red, respectively. The region containing beads that did not 
hybridize to any probe is black and the region containing beads that hybridized to both 
probes is blue. The blue beads arose from aqueous compartments itn which i both types i of 
template molecules were present. The proportion of singlet beads that hybridized to at least 
one of the probes was 2.9%, 4.3%, and 20.3% in (B) to (D), respectively. 
The FSC and SSC plots in (A) represent the same beads analyzed in (u). 
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Fig. 4 Density plots of BEAMing using genomic DNA or RT-PCR products as templates. The 
data in (A) and (B) were generated by including 10 and 1 ug of human genomic DNA, 
respectively, in the microemulsions, querying the MID42 locus. The data in (C) and (D) were 
generated using emulsions that contained ~ 50 pg of PCR products synthesized from cDNA 
of lymphoblastoid cells, querying the calpain-10 locus. The green and red regions 
correspond to the L and S alleles for MID42 and to the A and G alleles for calpain-10. The 
number of beads in the outlined regions containing red or green beads is shown in each 
case. The proportion of singlet beads that hybridized to at least one of the probes was 1 .2%, 
0.6%, 6.8% and 4.2% in (A) to (D), respectively. The outlined regions used for counting in 
(A) and (B) were identical, as were those used for (C) and (D). Beads that did not hybridize 
to any probe were gated out and therefore not evident in the graphs, while the region 
containing beads that hybridized to both probes is labeled blue. 



Pressman etaL. page 13 




Fig. 5 Detection and validation of variants present in a minor fraction of the DNA population. 
(A) Mixtures of PCR products containing 0% to 4% L alleles of MID42 were used for 
BEAMing. Flow cytometry such as that shown in Fig. 3 was used to determine the fraction of 
singlet beads that were red (y-axis). The proportion of singlet beads that hybridized to at 
least one of the probes varied from 3.2% to 4.3%. (B) Beads were sorted with the FACS 
Vantage SE instrument and individual red or green beads were used as templates for 
conventional PCR employing the forward and reverse primers listed in Supplementary Table 
I. Red beads generated only the S allele sequence while green beads generated only the L 
allele sequence. 
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Table I Oligonucleotides 



Locus 

MID42 
MID42 
MID42 
MID42 
MID42 

Calpain-10 
Catpain-10 
Calpain-10 
Calpain-10 
Calpain-10 

KRAS2 
KRAS2 
KRAS2 
KRAS2 
KRAS2 



Oligonucleotide* 

5'-tactatgtatttatacgttaagacctctatgaatgaatgta 
S'-cgttaagacctctatgaatgaatgta 
S'-gaaaggtaagtacagggaaagg 



5* 



5'-cacgcagattgaattaaacagttacaaagacacgtg 



5'-aggtcccagagggtggaaggagccaggacgcacccccactgclgctg 
S'-aggtcccagagggtggaag 
5-ttgcgatggtcactgtgaag 
5'-cacggtaggtgctTgcaggcagcgtg 
5' -cacggtaggtgccCgcaggcagcgtg 

S'-ttcgtccacaaaatgattctgaattagctgtatcgtcaagg 
5-agaatggtcctgcaccagtaa 
S'-catgttctaatatagtcacattttca 
5*-cacgggagctGGTGGCgtagcgtg 
5'-ccacgggagctgatggcgtagcgtgg 

*Bases in upper case represent allelic differences. 

"Hybridization probes each contained 4 bases at their 5' and 3 1 ends to form hairpins, as 
explained in the text. 



M dtfication 


Use** 


5' Dual biotin 


Bound to Beads 


none 


Forward Primer for PCR 


none 


Keverse r* rimer tor run 


5' 6-FAM 


Hybridization probe for L allele 


5* Biotin 


Hybridization probe for S allele 


5* Dual Biotin 


Bound to Beads 


none 


Forward Primer for PCR 


none 


Reverse Primer for PCR 


5' 6-FAM 


Hybridization probe for A altele 


5' Biotin 


Hybridization probe for G allele 


5* Dual Biotin 


Bound to Beads 


none 


Reverse Primer for PCR 


none 


Forward Primer for PCR 


5 1 6-FAM 


Hybridization probe for wt allele 


5' Biotin 


Hybridization probe for mutant allele 
• 
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